Model adaptation and adaptive training using ESAT algorithm for HMM-based speech synthesis

نویسندگان

  • Juri Isogai
  • Junichi Yamagishi
  • Takao Kobayashi
چکیده

In speaker adaptation for HMM-based speech synthesis, model adaptation and adaptive training techniques play key roles. For reducing dependency on an initial model and adapting the model to wide-ranging target speakers, we propose speaker adaptation and adaptive training algorithms based on ESAT algorithm for HMM-based speech synthesis. The ESAT algorithm estimates contributing rate of several given initial models and combines them depending on likelihood of adaptation data for the target speaker. In this study, we incorporate the ESAT algorithm into a framework of hidden semi-Markov model (HSMM) to adapt both state output and duration distributions and convert both voice characteristics and prosodic features. From the results of subjective tests, we show that the ESAT algorithm lessen the dependence of synthetic speech quality on the initial model and has the potential ability for a wider range of the target speakers.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

HMM-based polyglot speech synthesis by speaker and language adaptive training

This paper describes a technique for speaker and language adaptive training (SLAT) for HMM-based polyglot speech synthesis and its evaluations on a multi-lingual speech corpus. The SLAT technique allows multi-speaker/multi-language adaptive training and synthesis to be performed. Experimental results show that the SLAT technique achieves better naturalness than both speaker-adaptively trained l...

متن کامل

Speaker-Independent HMM-based Speech Synthesis System

This paper describes an HMM-based speech synthesis system developed by the HTS working group for the Blizzard Challenge 2007. To further explore the potential of HMM-based speech synthesis, we incorporate new features in our conventional system which underpin a speaker-independent approach: speaker adaptation techniques; adaptive training for HSMMs; and full covariance modeling using the CSMAPL...

متن کامل

Speaker-adaptive visual speech synthesis in the HMM-framework

In this paper we apply speaker-adaptive and speakerdependent training of hidden Markov models (HMMs) to visual speech synthesis. In speaker-dependent training we use data from one speaker to train a visual and acoustic HMM. In speaker-adaptive training, first a visual background model (average voice) from multiple speakers is trained. This background model is then adapted to a new target speake...

متن کامل

An improved minimum generation error based model adaptation for HMM-based speech synthesis

Aminimum generation error (MGE) criterion had been proposed for model training in HMM-based speech synthesis. In this paper, we apply the MGE criterion to model adaptation for HMM-based speech synthesis, and introduce an MGE linear regression (MGELR) based model adaptation algorithm, where the regression matrices used to transform source models are optimized so as to minimize the generation err...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005